Some Issues on the Study of Vocal Tract Normalization
نویسندگان
چکیده
Vocal tract normalization (VTN) is an effective way to reduce inter-speaker variability mainly caused by variation of vocal tract shape among speakers of different genders and age groups. In this paper, some practical implementation issues of VTN are discussed. We adopted a method to train model and selected the proper normalization scales of different speakers. The acoustic model is estimated from the unnormalized acoustic vectors of large speakers by maximum likelihood training. Then we use the gender-independent model to select the proper normalization scales of different speaker. The above steps are repeated. For VTN in training, we discussed with the drift effect of the warp parameter with the increasing of the number of iterations and the number of mixtures of the acoustic model. We studied the distribution of the warp parameter of different genders and age groups. To facilitate the fast warp parameter selection process, we proposed a hierarchical method and compared with the traditional methods.
منابع مشابه
Effects of Voice Therapy on Vocal Tract Discomfort in Muscle Tension Dysphonia
Introduction: Patients with muscle tension dysphonia (MTD) suffer from several physical discomforts in their vocal tract. However, few studies have examined the effects of voice therapy (VT) on the vocal tract discomfort (VTD) in patients with voice disorders. Therefore, the aim of the present study was to investigate the effects of VT on the VTD in patients with MTD. Materi...
متن کاملAvicenna's Anatomical Legacy as Seen Through the Relevant Topics in Modern Anat-omy
Background: Makhaarej Al-Horouf, the study of speech sounds by Avicenna is a valuable piece of work in the study of speech sounds, which was written about ten centuries ago. It contains six chapters on sound, anatomy of vocal tract, and phonetics. It is amazing to find that Avicenna’s explanations are congruent with the findings of modern scholarship in relevant topics. The study was intended t...
متن کاملReal-Time Vocal Tract Length Normalization in a Phonological Awareness Teaching System
Speaker normalization in a speech recognition can significantly improve speech recognition accuracy. One such method, vocal tract length normalization (VTLN), is especially useful when the system has to work reliably for males, females and children. It is just this situation with our phonological awareness teaching system, the “SpeechMaster”, which aims at real-time phoneme recognition and feed...
متن کاملExtrinsic normalization for vocal tracts depends on the signal, not on attention
When perceiving vowels, listeners adjust to speaker-specific vocal-tract characteristics (such as F1) through "extrinsic vowel normalization". This effect is observed as a shift in the location of categorization boundaries of vowel continua. Similar effects have been found with non-speech. Non-speech materials, however, have consistently led to smaller effectsizes, perhaps because of a lack of ...
متن کاملSpeaker normalization based on frequency warping
In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependence of the speech feature, and the variation of vocal tract shape is the major source of inter-speaker variations of the speech feature, though there are some other sources which also contribute. In this paper, we address the approaches of speaker normalization which aim at normalizing speaker's v...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002